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Abstract: Based on real data, a new parameterized model of the Main Drift Chamber response is proposed. In this 
model, we tune the ratio of good hits and the residual distribution separately. The difference between simulation and 
data in track reconstruction efficiency reduces from 1% to 0.5%, and the momentum resolution agreement improves 
significantly. 
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1 Introduction 

The Beijing Spectrometer III (BESIII) 1], which op- 
erates at the upgraded Beijing Electron Positron Collider 
(BEPCII), consists of the following sub-detectors: Main 
Drift Chamber (MDC), Time of Flight Counter (TOF), 
Electromagnetic Calorimeter (EMC), and Muon Counter 
(MUC). The MDC is the core sub-detector, measuring 
the decay vertex, energy loss and momentum of charged 
particles precisely. There are 6796 sense wires with a pos- 
itive high voltage (HV) , most of which are surrounded by 
8 field wires. A drift cell is defined as the sensitive re- 
gion of a sense wire. The whole MDC is in a 1.0 Tesla 
magnetic field provided by a super-conducting solenoid 
between EMC and MUC. 

The MDC plays an important role in reconstruct- 
ing charged tracks and identifying particles. A charged 
particle passing the drift cell ionizes the surrounding 
atoms of gas. The primary electrons drift towards the 
sense wire, being accelerated by electric field in the drift 
cell, and thereby initiating electron avalanches. These 
stripped electrons are eventually collected by sense wires 
and then produce electronic signals. If the magnitude of 



a signal exceeds a threshold value, relevant information 
will be recorded by the electronic system. This fired wire 
is called a hit. 

Since the first physics data taking in 2009, the peak 
luminosity of BEPCII has reached 7 x 10 32 cm _2 s _1 , 
which is about 70 times of BEPC. High luminosity and 
huge data sample reduce the statistical error of physical 
measurement significantly, and also call for a decrease 
of systematic uncertainty. A reliable Monte-Carlo (MC) 
simulation is important since event selection, efficiency 
calculation, and background estimation in the physics 
analysis, all rely heavily on it. Based on the GEANT4J2J 
package, the BESIII Object Oriented Simulation Tool 
(BOOST) 0| is developed on the framework of the BESIII 
Offline Software System (BOSS) [4]. For physics analysis, 
good consistencies of tracking efficiency and momentum 
resolution between MC and data are crucial. In this let- 
ter, we describe a new method based on hits category to 
improve the MDC MC tuning (short for "tuning"). 
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2 Tuning principles 

The tracking efficiency is positively related to the raw 
hit efficiency, which is the probability of sense wire firing 
when a charged particle passes though the drift cell. The 
momentum resolution is mainly contributed by intrinsic 
resolution and multiple scattering. The latter factor has 
been considered in GEANT4 package. The intrinsic reso- 
lution of MDC reflects the errors of particle position mea- 
surement. It contains the effects of primary ionization 
position, electron diffusion along the drift path, amount 
and positions of avalanches, and distortion of the electric 
field at the edges of the drift cell. These processes are 
complicated for simulation from the first principle due to 
the limitation of computing power. The spatial resolu- 
tion effect composes of intrinsic resolution contribution, 
multiple scattering, beam background, electronic noise, 
and tracking algorithm. 

Thus we build a parameterized model based on the 
raw hit efficiency and the spatial resolution for tuning. 
The values of parameters are extracted from the rele- 
vant distributions of real data. In order to get a better 
consistency, the iteration procedures are applied: using 
the model with initial values to generate simulation sam- 
ple; then, comparing its relevant distributions with real 
data's and adjust the input values. 

In practise, the spatial resolution is described by 
residual distribution. Therefore, raw hit efficiency and 
residual distribution are two key indicators when tun- 
ing. 

3 Previous method 

With the past experience from BESII 5], we find it 
difficult to obtain a consistency of the two key indicators 
simultaneously, by only tuning residual distribution. A 
better strategy is to tune the raw hit efficiency and the 
residual distribution separately [6]. 

3.1 Raw hit efficiency 

The raw hit efficiency is given by: 



3.2 Residual distribution 

The residual of a hit is defined as: 



" ("drift &docai 



(2) 



where d dri f t is the drift distance of the hit, calculated 
by drift time and distance-time relation (X-T relation); 
the DOC A, d coca , is calculated by the parameters of the 
helix — which are obtained from the other hits on this 
track. 

The residual distribution depends on layer, DOCA, 
and entrance angle. Thus, all hits are categorized by dif- 
ferent layers, DOCA, and entrance angles, and we tune 
each residual distribution one by one. Double-Gaussian 
is used to describe residual shape of signal. And a 2- 
order Polynomial is employed to describe the shape of 
background and noise. In simulation, the contribution of 
beam background has been considered by random back- 
ground sampling. Thus the polynomial part will not be 
tuned. As a result, there are five parameters for the 
residual distribution: (/, /i l5 a l7 ^ 2 , cr 2 ), in which / 
stands for the fraction of the first gaussian; fi ± , /i 2 stands 
for the mean values of the two gaussian; and Ci , er 2 stands 
for the standard deviations. 

3.3 Limitation and discussion 

In the previous method, the average of tracking effi- 
ciency difference could be reduced to 1%. Further com- 
parison shows that a part of residual distributions near 
the sense wire has a tail at positive side, especially in 
inner layers. 

The residual tends to be positive due to its definition 
(Eq. [2]) . As shown in Fig. [TJ typically, d doca will not be 
larger than ddrift- This kind of discrepancy is enlarged 
in inner layers, where the lower working voltage than de- 
signed is applied to protect detector from high level of 
background. It requires more primary ionization to trig- 
ger the electronics. This could make X farther from C. 
As a result, the residual could be much larger. 
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where N pas3 is the number of drift cells passed by track; 
N hit means the number of drift cells fired in N paas . We 
use helix parameters given by reconstruction algorithm 
to predict N paas . 

The e H is a function of cell position, distance of clos- 
est approach (DOCA), and dip angle. So we divide it 
into three parts: e celh e doca , and e cos e and tune them 
separately. 



Fig. 1. A schematic diagram of residual. W is a 
sense wire. A particle flies along AB. WC is 
DOCA. Solid dots on AB are the positions of pri- 
mary ionization. Suppose that X triggers the elec- 
tronics. WX is always longer than WC. 
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In order to verify the judgement, we select electron 
tracks from a Bhabha sample which is collected with full 
high voltage on inner layers. In Fig. [2j plot (a) is inner 
layer with normal voltage. Plot (b) is inner layer with 
full voltage. We also put an outer layer residual shape 
with full voltage as a reference (plot (c)). The conclu- 



sion is that with lower voltage on the wire, the tail tends 
to include more signal hits. And since these hits have 
no relationship with beam background, they can not be 
taken into account in neither previous tuning nor random 
background sampling. 
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Fig. 2. Residual distributions. Plot (a) is inner layer with lower HV — the tail contains signals with large residual, 
(b) is inner layer with full HV — the tail diminishes, (c) is outer layer with full HV. 



4 New method 



4[2 Revised model 



In order to improve the agreement between data and 
MC, a new method is developed to consider those signals 
with relatively larger residual. 

4.1 Hits category 

The residual distribution is obtained by track recon- 
struction algorithm[7J, |8(. The reconstruction algorithm 
drops those hits which make large contribution to \ 2 
when fitting. As a result, quality of a track is dominated 
by the hits adopted. In the previous method, we tune 
all hits as a whole. However, the consistency of the five 
parameters of double-Gaussian does not necessarily en- 
sure the agreement of the residual resolution of the hits 
on track. 

A natural way of classification of all hits is based on 
whether a hit is used by tracking reconstruction: the hits 
used eventually on track are good hits while those not 
used on track are bad hits. 

The good-hit efficiency should be one of the criteria 
of the track quality. It is defined as: 
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N pae3 is the same 



where N ghit is the number of good hits 

as in Eq.[TJ In the previous method, getting a better con 

sistency of the good-hit efficiency requires to broaden or 

shrink the shape of double-Gaussian. This will change 

the good hits residual distribution. Thus it is difficult 

to reduce the disagreement between simulation and data 

further. 



Since the raw hit efficiency is tuned before residual 
distribution, we retain the model of the raw hit effi- 
ciency, but rebuild a model of the residual distribution. 
We use a parameter Ratio to describe the proportion of 
good hits, a double-gaussian function with five parame- 
ters (two means, two standard errors and a fraction) to 
describe the residual distribution of good hits, and a uni- 
form distribution of large residual to describe the shape 
of bad hits. 

Because a new parameter, Ratio, is introduced, 
double- Gaussian only focus on good hits distribution. 
This not only reduces the impact on good-hit efficiency, 
but also makes the fitting easer since the shape becomes 
better (Fig. [3]). 



4.3 Tuning procedures 

The process of tuning raw hit efficiency keeps the 
same with the previous method. After that, we tune the 
residual distribution. Due to the complicated correlation 
among the parameters, we make an order by their sen- 
sitivity. The Ratio is tuned before the double-Gaussian 
shape. 
4.3.1 Obtaining the initial values 

We get the initial values of the 6 parameters from 
the residual distribution of real data. The Ratio is 
calculated by counting the numbers of good and to- 
tal hits (Fig. [3J (a)). The rest five parameters are ob- 
tained by fitting the good hits distributions (Fig. [3j(b)). 
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Fig. 3. (a) Two hit components: good hits 
(shaded) and bad hits (blank), (b) Fitting result 
of good hits distribution by double-Gaussian. 

4.3.2 Ratio tuning 

We generate a MC sample by these initial values and 
compare the value of Ratio with real data's. Based on 
the discrepancy, we change the input value while keep 
the other parameters fixed. 

4.3.3 Double-Gaussian identification 

In order to decide which /i or a should be modified, 
it is necessary to judge which Gaussian describing the 
certain part of the shape. Since a is changing during 
tuning, we use h as the identifier: 



h = f/a, 



(4) 



where / is the fraction of the Gaussian; a is the standard 
error. The Gaussian with larger h is labeled as g ll and 
the other one is g 2 - 

4.3.4 Gaussian shape tuning 

The most sensitive parameter of double- Gaussian is 
<7i since gi plays a main role in residual distribution. Af- 
ter several iterations, the main part of the residual shape 
keeps consistent. Then we tune <7 2 , /, /Ui, and ^ 2 in se- 
quence. 

4.3.5 Tuning performance 

We compare the raw hit efficiency and good-hit effi- 
ciency with real data after tuning, respectively (Fig. 2]). 
After tuning, the efficiencies of MC and data agree well. 
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Fig. 4. Tuning results and comparisons to data, 
(a) raw hit efficiency in each layer, (b) good-hit 
efficiency in each layer. 

Wc also compare the difference of root mean square 
(RMS) of residual distribution in the two method in each 
layer (Fig. [5]). After the improvement, the difference be- 
tween MC and data become much smaller. 
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Fig. 5. RMS difference between MC and data. Full 
triangles down stand for the previous method and 
open squares stand for the improved method. 



5 Check with physics objects 

In order to test the new method, we pick out some 
typical physics channels to compare results from simula- 
tion and real data. 

5.1 Tracking efficiency 

We use 7T from J/ip — > -k + it~it to check the difference 
of tracking efficiency. Tracking efficiency is defined as: 



N 



found 



N r . 



(5) 



where N recoi is number of events in which tt° and 
tagged charged pion have been reconstructed and 
the recoiled mass falls in the region of n; Nf ound 
is the number of events from N recoi in which the 
recoiled w is reconstructed successfully. Fig. [6] 
shows that the difference of tracking efficiency 
between MC and data is about 0.5% averagely. 
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Fig. 6. The tracking efficiency of n from J/ip — > 
7r + 7r~7r°. (a) tracking efficiency in different trans- 
verse momentum and (b) efficiency difference. 

5.2 Momentum resolution 

Protons from J/-0 —¥ pp are used to check the dif- 
ference of momentum resolution. Fig. [7] shows the mo- 
mentum distributions of the proton. The resolutions are 
listed in Table [TJ The new method has a significant im- 
provement. 



6 Summary 

In this work, we notice that signals could also cause 
large residuals. We employ a new category of hits, and 
revise the model of residual distribution. It can focus 
on the good-hit efficiency and good hits residual shape 
directly. 

Several advantages of the new method include: 

• Describing the real data better by splitting MDC 
hits into good and bad hits. 



10000 



5000 




10000 



5000 




(a) (b) 

Fig. 7. The momentum resolutions of proton, (a) 
previous method and (b) new method. Black dots 
are data and histograms are MC. 

Table 1. Momentum resolutions for data and MC. 
The difference decreases significantly. 

Previous/MeV New/MeV 



"MC 



""data 
CMC 

1(%) 



7.827±0.013 

7.097±0.011 

-9.327±0.002 



7.827±0.013 

7.639±0.035 

-2.402±0.005 



• Reducing the correlation of good hits efficiency and 
spatial resolution, which is helpful to get better 
agreements of the tracking efficiency and momen- 
tum resolution at the same time. 

• Accelerating the process of tuning by using an ef- 
ficient double-Gaussian identifier. 

Using the new method, the differences in track recon- 
struction efficiency and momentum resolution decrease 
significantly. In the future, more effort should be de- 
voted to understand the asymmetry near the sense wire 
and find out a better model to describe it. Also, the 
discrepancy of tracking efficiency at low momentum is 
relatively larger. Thus we should try to improve the 
agreement for low momentum tracks. If necessary, fur- 
ther improvement can be developed to consider other 
effects, such as electronic field leakage, dE/dx and so 
forth. 
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